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DETAILED ACTION 

Continued Examination Under 37 CFR 1.114 

A request for continued examination under 37 CFR 1.114, including the 
fee set forth in 37 CFR 1 .17(e), was filed in this application after final rejection. 
Since this application is eligible for continued examination under 37 CFR 1.114, 
and the fee set forth in 37 CFR 1.17(e) has been timely paid, the finality of the 
previous Office action has been withdrawn pursuant to 37 CFR 1.114. 
Applicant's submission filed on 02/11/08 has been entered. 

Applicant's Amendment filed 02/1 1/08 has been entered and carefully 
considered. Claims 1, 6, 18, 19, 28, 29, 37, 38, 39, 40, 45, 57, 58, 67, 68, 76, 
81, 93, 94, 103 and 104 have been amended. However, the limitations of the 
amended claims have not been found to be patentable over prior art of record, 
therefore, claims 1-111 are rejected under the new ground of rejection as set 
forth below. 

Claim Rejections - 35 USC § 103 

The following is a quotation of 35 U.S.C. 103(a) which forms the basis for 
all obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented 
and the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

Claims 1-2, 4-5, 13-15, 17, 37, 40-41,43-44, 52-54, 56, 76-77, 79-80, 88- 

90 and 92 , ;u - under 35 U.S.C. 103(a) as being unpatentable over Jain 
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et al. [US. 6,567,980]. 

As to claims 1, 37, 40 and 76, Jain et al. disclose a computer 
implemented method and corresponding apparatus for displaying multimedia 
information comprising the steps/means for displaying a graphical user interface 
(GUI) on the display (column 4, lines 20-25); displaying, in a first area of the 
GUI, a representation of the multimedia information stored by the multimedia 
document (figure 2, 172), the displayed representation of the multimedia 
information comprising a representation of information of the first type and a 
representation of information of the second type (figure 2, the frame 172 
contains both the first type: image and the second type: caption text); displaying 
a first lens covering a first portion of the first area (figure 2, the first lens covers 
the frame 172); displaying, in a second area of the GUI (figure 2, area 176), a 
representation of multimedia information displayed in the first portion of the first 
area (figure 2, one of the frame of area 1 76), the representation of multimedia 
information displayed in the second area comprising a portion of the 
representation of information of the first type covered by the first lens and a 
portion of the representation of information of the second type covered by the 
first lens (figure 2, in the Capture process of the video in area 172, the frame 
172 is captured and extracted from the process, then it is displayed in the 
second display 176. So, one of the multiple keyframes in the second display 
176 is the extracted keyframe of the first display 172, see column 3, line 50 
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through column 4, line 40); Jain et al. fails to clearly teach displaying a first lens 
moveable in response to user input over representations of multimedia 
information displayed in the GUI. However, Jain cited "When video capture is 
initiated by the user, the video encoding process starts at a begin step 762 and 
moves to step 764 wherein the Video Cataloger 1 1 0 (FIG.1 )." (figure 1 1 , lines 
26-30). The user input is over representations of multimedia information 
displayed in the GUI by initiating the video capture by the user. The user puts a 
first lens covering the desired frame 172 by capturing and selecting the desired 
video clip frame 172. If the user wants to capture a next video clip (like frame 
186), the user puts the first lens on the frame 186 by selecting and then 
displaying this frame in the panel 172. The video keeps moving and moving one 
video clip frame to another video clip frame. If the user wants a specific video 
clip frame, the user would put the first lens over the specific video clip frame by 
selecting and capturing the specific video clip frame. The first lens is moveable 
from one video clip frame to another video clip frame depending on user's 
selection of which video clip frame. Therefore, Jain et al. suggest displaying the 
first lens moveable in response to user input over the video. 

It would have been obvious to one of skill in the art, at the time the 
invention was made, to combine a well known implementation of the suggestion 
to Jain. Motivation of the combination is for flexib *\o-^ N ossng the « o cii£ 
frame. 
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As to claims 2, 41 and 77, Jain et al. also disclose displaying a first 
thumbnail image in the first area of the GUI (figure 2, 172), the first thumbnail 
image comprising the representation of information of the first type; and 
displaying a second thumbnail image in the first area of the GUI, the second 
thumbnail image comprising the representation of information of the second 
type (figure 2, column 4, lines 20-65). 

As to claims 4, 43 and 79, Jain et al. show determining a first time and a 
second time associating with the first lens; displaying, in the second area of 
the GUI, a representation of information of the first type occurring between 
the first time and the second time associated With the first lens; and 
displaying, in the second area of the GUI, a representation of information of 
the second type occurring between the first time and the second time 
associated with the first lens (figure 2, 178, column 6, line 40-60 and column 
7, lines 15 through column 8, line 60). 

As to claims 5, 44 and 80, Jain et al. also show receiving user input moving 
the first lens to cover a second portion of the first area; and responsive to the 
user input, automatically changing the information displayed in the second 
area of the GUI such that the representation of multimedia information 
displayed in the second area of the GUI corresponds to the representation of 
multimedia information included in the second portion of the first area (figure 
2, receiving the user input to select another keyframe of the video in the first 
display 172 and then display in the second display 176). 
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As to claims 13, 52 and 88, Jain et al. also teach the information of the first 
type corresponding to video information (column 2, lines 6-24); and the 
representation of the information of the first type comprising one or more 
video keyframes extracted from the video information (column 2, lines 17-22 
and column 4, lines 20-40). 

As to claims 14, 53 and 89, Jain et al. provide the information of the second 
type corresponding to audio information (column 2, lines 18-20); and the 
representation of information of the second type comprises text information 
obtained from transcribing the audio information (column 2, lines 15-25). 
As to claims 15, 54 and 90, Jain et al. also provide the information of the 
second type corresponding to closed-caption (CC) text information; and the 
representation of information of the second type comprises text (column 4, 
lines 20-50). 

As to claims 17, 56 and 92, Jain et al. also provide receiving input indicating 
selection of a portion of the multimedia information occurring between a first 
time and a second time; and performing a first operation on the portion of the 
multimedia information occurring between a first time and a second time 
(column 6, lines 40-60 and column 12, lines 30-50). 

Claims 3, 6-7, 9-12, 18-36, 38-39, 42, 45-46, 48-51, 57-75, 78, 81-82, 84-87 
and 93-1 1 1 rejected under 35 U.S.C. 103(a) as being unpatentable over Jain 
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et al. [US. 6,567,980]. 

As to claims 3, 42 and 78, Jain et al. disclose displaying the portion of 
the representation of information of the first type covered by the first lens in 
the second area of the GUI; and displaying the portion of the representation 
of information of the second type covered by the first lens in the second area 
of the GUI (figure 2, 172 (first area)). Jain et al. fail to clearly teach a first 
panel and a second panel. However, it was well known in the art that the first 
and second panels are displayed in the GUI area (see figure 2, first area 172, 
the first portion contains image, the second portion contains text). It would 
have been obvious to one of ordinary skill in the art at the time of the 
invention to display the first and second information types in the first and 
second panels in the second GUI area in order to provide a good and 
convenient GUI. 

As to claims 7, 46 and 82, Jain et al. fail to clearly teach the third area of the 
GUI. However, it was well known in the art to display the third area in the 
GUI. It would have been obvious to one of ordinary skill in the art at the time 
of the invention to display the third area of the GUI in order to provide a good 
and convenient GUI. 

As to claims 6, 45 and 81 , Jain et al. fail to clearly teach the third area 
of the GUI. However, it was well known in the art to display the third area in 
the GUI. It would have been obvious to one of ordinary skill in the art at the 
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time of the invention to display the third area of the GUI in order to provide a 
good and convenient GUI. 

Jain et al. fails to clearly teach displaying a first lens moveable in 
response to user input over representations of multimedia information 
displayed in the GUI. However, Jain cited "When video capture is initiated by 
the user, the video encoding process starts at a begin step 762 and moves to 
step 764 wherein the Video Cataloger 1 1 0 (FIG.1 )." (figure 1 1 , lines 26-30). 
The user input is over representations of multimedia information displayed in 
the GUI by initiating the video capture by the user. The user puts a first lens 
covering the desired frame 172 by capturing and selecting the desired video 
clip frame 172. If the user wants to capture a next video clip (like frame 186), 
the user puts the first lens on the frame 186 by selecting and then displaying 
this frame in the panel 172. The video keeps moving and moving one video 
clip frame to another video clip frame. If the user wants a specific video clip 
frame, the user would put the first lens over the specific video clip frame by 
selecting and capturing the specific video clip frame. The first lens is 
moveable from one video clip frame to another video clip frame depending 
on user's selection of which video clip frame. Therefore, Jain et al. suggest 
displaying the first lens moveable in response to user input over the video. It 
would have been obvious to one of skill in the art, at the time the invention 
was made, to combine a well known implementation of the suggestion to 
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Jain. Motivation of the combination is for a flexibility of choosing the video 
clip frame. 

As to claims 9, 22, 48, 61, 84 and 97, Jain et al. disclose receiving user 
input moving the first lens to cover a first portion of the first area and 
responsive the user input, automatically changing the information displayed 
in the second area of the GUI (figure 2, the user can selects another 
keyframe in the first display 172 and then display it in the second display 
176). But, Jain et al. fail to clearly teach a third area in order to automatically 
change the information displayed in the third area of the GUI. However, it 
was well known in the art to display the third area in the GUI. It would have 
been obvious to one of ordinary skill in the art at the time of the invention to 
display the third area of the GUI in order to provide a good and convenient 
GUI. 

As to claims 10, 49 and 85, Jain et al. disclose receiving a user input 
moving the first lens to cover a second portion of the first area and 
responsive to the user input, automatically changing the information 
displayed in the second area of the GUI corresponds to the representation of 
multimedia information included in the second portion of the first area (figure 
2, the first lens cover first and second portions of the keyframe in the first 
display 172). Jain et al. fail to clearly teach the information displaying in the 
third area of the GUI. However, it was well known in the art to display the 
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third area in the GUI. It would have been obvious to one of ordinary skill in 
the art at the time of the invention to display the third area of the GUI in order 
to provide a good and convenient GUI. 

As to claim 11, Jain et al. teach displaying a sub-lens covering a portion of 
the first area of the GUI corresponding to the first portion of the second area 
of the GUI covered by the second lens (figure 2, column 6, 42-51, the sub- 
lens is used to select any other keyframe in the first display 172). 

As to claim 12, Jain et al. also teach receiving a user input moving the 
second lens to cover a second portion of the second area; and responsive to 
the user input, automatically changing a position of the sub-lens to cover a 
portion of the first area of the GUI corresponding to the second portion of the 
second area (figure 2, column 6, 42-51 , the sub-lens is used to select any 
other keyframe in the first display 172). 

As to claims 18, 38, 57 and 93, Jain et al. disclose displaying a graphical 
user interface, displaying, in a first area of the GUI, a representation of the 
multimedia information stored by the multimedia document occurring 
between a start time and an end time associated with the multimedia 
document (figure 2, 178 and time line 180). Each of the video images occurs 
between a start time to an end time (column 6, lines 1-25). But, Jain et al. fail 
to clearly teach the step of displaying a first lens emphasized a portion of the 
first area of the GUI, the portion of the first area emphasized by the first lens 
comprising a representation of multimedia information occurring between a 
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first time (t1) and a second time (t2). 

However, it was well known in the art to display a first lens emphasizing a 
portion of the first area of the GUI, the portion of the first area emphasizing 
by the first lens comprising a representation of multimedia information 
occurring between a first time (t1 ) and a second time (t2). 
It would have been obvious to one of ordinary skill in the art at the time of the 
invention to have the portion of the first area emphasized by the first lens 
comprising a representation of multimedia information occurring between a 
first time (t1 ) and a second time (t2) in order to provide a good and 
convenient GUI. 

Jain et al. fails to clearly teach displaying a first lens moveable in 
response to user input over representations of multimedia information 
displayed in the GUI. However, Jain cited "When video capture is initiated by 
the user, the video encoding process starts at a begin step 762 and moves to 
step 764 wherein the Video Cataloger 110 (FIG.1)." (figure 11, lines 26-30). 
The user input is over representations of multimedia information displayed in 
the GUI by initiating the video capture by the user. The user puts a first lens 
covering the desired frame 172 by capturing and selecting the desired video 
clip frame 172. If the user wants to capture a next video clip (like frame 186), 
the user puts the first lens on the frame 186 by selecting and then displaying 
this frame in the panel 172. The video keeps moving and moving one video 
clip frame to another video clip frame. If the user wants a specific video clip 
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frame, the user would put the first lens over the specific video clip frame by 
selecting and capturing the specific video clip frame. The first lens is 
moveable from one video clip frame to another video clip frame depending 
on user's selection of which video clip frame. Therefore, Jain et al. suggest 
displaying the first lens moveable in response to user input over the video. 

It would have been obvious to one of skill in the art, at the time the 
invention was made, to combine a well known implementation of the 
suggestion to Jain. Motivation of the combination is for ovr \ o 
choosing the video dip frame. 

As to claims 19, 58 and 94, In light of rejection of the third area, Jain et 
al. show displaying a second lens emphasizing a portion of the second area 
of the GUI, the portion of the second area emphasized by the second lens 
comprising a representation of multimedia information occurring between a 
third time (t3) and a fourth time (t4) (each video image has different start-end 
time. So, a first image has its start-end time (t1-t2) and a second image has 
its start-end time (t3-t4). 

Jain et al. fail to clearly teach the step of displaying, in a third area of the 
GUI, the representation of multimedia information occurring between t3 and 
t4, the representation of multimedia information displayed in the third area 
comprising a representation of information of the first type occurring between 
t3 and t4 and a representation of information of the second type occurring 
between t3 and t4. 
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However, it was well known in the art to display the third area in the GUI. It 
would have been obvious to one of ordinary skill in the art at the time of the 
invention to display the third area of the GUI in order to provide a good and 
convenient GUI. 

Jain et al. fails to clearly teach displaying a first lens moveable in 
response to user input over representations of multimedia information 
displayed in the GUI. However, Jain cited "When video capture is initiated by 
the user, the video encoding process starts at a begin step 762 and moves to 
step 764 wherein the Video Cataloger 1 1 0 (FIG.1 )." (figure 1 1 , lines 26-30). 
The user input is over representations of multimedia information displayed in 
the GUI by initiating the video capture by the user. The user puts a first lens 
covering the desired frame 172 by capturing and selecting the desired video 
clip frame 172. If the user wants to capture a next video clip (like frame 186), 
the user puts the first lens on the frame 186 by selecting and then displaying 
this frame in the panel 172. The video keeps moving and moving one video 
clip frame to another video clip frame. If the user wants a specific video clip 
frame, the user would put the first lens over the specific video clip frame by 
selecting and capturing the specific video clip frame. The first lens is 
moveable from one video clip frame to another video clip frame depending 
on user's selection of which video clip frame. Therefore, Jain et al. suggest 
displaying the first lens moveable in response to user input over the video. 
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It would have been obvious to one of skill in the art, at the time the 
invention was made, to combine a well known implementation of the 
suggestion to Jain. Motivation of the combination is for o , N \ v 
choosing the video dip frame. 

As to claims 20, 59 and 95, Jain et al. fail to clearly teach a fifth and 
sixth time, however, in light of rejection of t1 , t2, t3 and t4, it was well known 
in the computer art to represent of multimedia information occurring between 
a t5 and t6. It would have been obvious to one of ordinary skill in the art at 
the time of the invention to display the third area of the GUI in order to 
provide a good and convenient GUI. 

As to claims 21, 23, 60, 62, 96 and 98, Jain et al. fail to clearly teach the 
step of changing the position of the second lens in response to user input 
such that the second lens emphasizes a portion of the second area of the 
GUI comprising a representation of multimedia information occurring 
between a fifth time (t5) and a sixth time (t6). However, in light of rejection of 
the third area and t5 and t6, it would have been obvious to a person of 
ordinary skill in the art at the time of the invention to change the position of 
the second lens in response to user input such that the second lens 
emphasizes a portion of the second area of the GUI comprising a 
representation of multimedia information occurring between a fifth time (t5) 
and a sixth time (t6). The motivation would have been to provide a 
convenient and good interface. 
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As to claims 24, 63 and 99, Jain et al. show the information of the first type 
being video information; the information of the second type being audio 
information; the representation of the information of the first: type comprising 
one or more video keyframes extracted from the video information; and the 
representation of information of the second type comprising text information 
obtained from transcribing the audio information (column 2, lines 6-24, 
column 2, lines 17-22 and column 4, lines 20-40 and column 2, lines 15-25). 
As to claims 25, 64 and 100, Jain et al. also show the information of the first 
type being video information; the information of the second type being 
closed-caption (CC) text information; the representation of the information of 
the first type comprising one or more video keyframes extracted from the 
video information; and the representation of the information of the second 
type comprising text information included in the CC text information (column 
4, lines 20-50). 

As to claims 26, 65 and 101, Jain et al. disclose receiving information 
indicating a first topic; and analyzing the multimedia information stored in the 
multimedia document to identify one or more locations in the multimedia 
information that are relevant to the first topic (column 9, line 52 through 
column 10, line 21); wherein displaying the representation of the multimedia 
information stored by the multimedia document occurring between ts and te 
in the first area of the GUI comprises highlighting the one or more locations 
in the multimedia information displayed in the first area of the GUI (column 8, 
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lines 23-65); and wherein displaying the representation of multimedia 
information occurring between t1 and t2 in the second area of the GUI 
comprises highlighting the one or more locations in the multimedia 
information that occur between times t1 and t2 (column 6, lines 28-67). 
As to claims 27, 66 and 102, Jain et al. teach receiving input indicating 
selection of a portion of the multimedia information occurring between a 
selection start time and a selection' end time; and performing a first operation 
on the portion of the multimedia information occurring between the selection 
start time and the selection end time (column 6, lines 40-60 and column 12, 
lines 30- 50). 

As to claims 28, 39, 67 and 103, Jain et al. disclose displaying a graphical 
user interface; displaying, in a first area of the GUI, a representation of the 
multimedia information stored by the multimedia document occurring 
between a start time and an end time associated with the multimedia 
document (figure 2, 178 and time line 180). Each of the video images occurs 
between a start time to an end time (column 6, lines 1-25). Jain et al. also 
show displaying, in first section of a first area of the GUI, a first set of one or 
more video keyframes extracted from the video information occurring 
between a start time and an end time associated with the multimedia 
document (column 6, lines 42-60); in light of rejection of t1 and t2, it would 
have been obvious to a person of ordinary skill in the art at the time of the 
invention to display a first lens emphasizing a portion of the first section of 
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the first area occurring between a first time and a second time and a portion 
of the second section of the first area occurring between t1 and t2, the 
emphasized portion of the first section of the first area comprising a second 
set of one or more video keyframes extracted from the video information. The 
motivation would have been to provide a convenient and good interface. 

Jain et al. fails to clearly teach displaying a first lens moveable in 
response to user input over representations of multimedia information 
displayed in the GUI. However, Jain cited "When video capture is initiated by 
the user, the video encoding process starts at a begin step 762 and moves to 
step 764 wherein the Video Cataloger 110 (FIG.1)." (figure 11, lines 26-30). 
The user input is over representations of multimedia information displayed in 
the GUI by initiating the video capture by the user. The user puts a first lens 
covering the desired frame 172 by capturing and selecting the desired video 
clip frame 172. If the user wants to capture a next video clip (like frame 186), 
the user puts the first lens on the frame 186 by selecting and then displaying 
this frame in the panel 172. The video keeps moving and moving one video 
clip frame to another video clip frame. If the user wants a specific video clip 
frame, the user would put the first lens over the specific video clip frame by 
selecting and capturing the specific video clip frame. The first lens is 
moveable from one video clip frame to another video clip frame depending 
on user's selection of which video clip frame. Therefore, Jain et al. suggest 
displaying the first lens moveable in response to user input over the video. 
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It would have been obvious to one of skill in the art, at the time the 
invention was made, to combine a well known implementation of the 
suggestion to Jain. Motivation of the combination is for o , N \ v 
choosing the video dip frame. 

As to claims 29, 68 and 104, in light of rejection above about the 
displaying a second lens emphasizing a portion of the first section of the 
second area and a portion of the second section of the second area; and the 
video keyframes, it was well known in the computer art to have the step of 
"the emphasized portion of the first section of the second area comprising a 
third set of one or more video keyframes extracted from the video information 
occurring between a third time (t3) and a fourth time (t4) and the third set of 
the video keyframes being a subset of the second set of one or more video 
keyframes. It would have been obvious to one of ordinary skill in the art at 
the time of the invention to display the emphasized portion of the first section 
of the second area comprising a third set of one or more video keyframes 
extracted from the video information occurring between a third time (t3) and a 
fourth time (t4) in order to provide a good and convenient GUI. 

Jain et al. fails to clearly teach displaying a first lens moveable in 
response to user input over representations of multimedia information 
displayed in the GUI. However, Jain cited "When video capture is initiated by 
the user, the video encoding process starts at a begin step 762 and moves to 
step 764 wherein the Video Cataloger 110 (FIG.1 )." (figure 1 1 , lines 26-30). 



Application/Control Number: 10/081,129 
Art Unit: 2179 



Page 



The user input is over representations of multimedia information displayed in 
the GUI by initiating the video capture by the user. The user puts a first lens 
covering the desired frame 172 by capturing and selecting the desired video 
clip frame 172. If the user wants to capture a next video clip (like frame 186), 
the user puts the first lens on the frame 186 by selecting and then displaying 
this frame in the panel 172. The video keeps moving and moving one video 
clip frame to another video clip frame. If the user wants a specific video clip 
frame, the user would put the first lens over the specific video clip frame by 
selecting and capturing the specific video clip frame. The first lens is 
moveable from one video clip frame to another video clip frame depending 
on user's selection of which video clip frame. Therefore, Jain et al. suggest 
displaying the first lens moveable in response to user input over the video. 

It would have been obvious to one of skill in the art, at the time the 
invention was made, to combine a well known implementation of the 
suggestion to Jain. Motivation of the combination is for - ^ ■> u ^ 
choossng the video clip frame. 

As to claims 30, 69 and 105, in light of rejection of claim 29, Jain et al. 
also teach outputting video information starting from t3 or from t4 or from a 
time between t3 and t4 in a first section of a third area of the GUI (column 6, 
lines 1- 55 and column 13, lines 35-67); displaying text information 
corresponding to the information of the first type occurring between t3 and t4 
in a second section of the third area of the GUI (column14, lines 26-67). It 
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would have been obvious to one of ordinary skill in the art at the time of the 
invention to display the emphasized portion of the first section of the second 
area comprising a third set of one or more video keyframes extracted from 
the video information occurring between a third time (t3) and a fourth time 
(t4) in order to provide a good and convenient GUI. 

As to claims 31, 70 and 106, Jain et al. show the information of the first 
type being audio information (column 2, lines 10-25); and the text information 
corresponding to the information of the first type being obtained from 
transcribing the audio information (column 4, lines 40-67). 
As to claims 32, 71 and 107, Jain et al also show the information of the first 
type being closed-caption text information, and the text information 
corresponding to the information of the first type being extracted from the CC 
text information (column 4, lines 20-60 and column 8, line 50 through column 
9, line 20). 

As to claims 33, 72 and 108, Jain et al. fail to clearly teach a first set of one 
or more slides extracting from the slides information occurring between ts 
and te. However, Jain et al. show the images instead. It was well known in 
the multimedia information that images can be displayed instead. It would 
have been obvious to one of ordinary skill in the art at the time of the 
invention to display the images of the GUI in order to provide a good and 
convenient GUI. 

As to claims 34, 73 and 109, the claim is analyzed previously discuss with 
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respect claims 29 and 33. 

As to claims 35, 74 and 110, in light of rejection above, Jain et al. fail to 
clearly teach the step of "a first set of one or more whiteboard images image 
extract from the whiteboard images information occurring between ts and te". 
However, the whiteboard images were well known in the multimedia 
information environment. It would have been obvious to one of ordinary skill 
in the art at the time of the invention to display the whiteboard images of the 
GUI in order to provide a good and convenient GUI. 
As to claims 36, 75 and 111, the claim is analyzed previously discuss with 
respect claims 28 and 35. 

As to claim 50, Jain et al. teach displaying a sub-lens covering a portion of 
the first area of the GUI corresponding to the first portion of the second area 
of the GUI covered by the second lens (figure 2, column 6, 42-51 , the sub- 
lens is used to select any other keyframe in the first display 172). 
As to claim 51, Jain et al. also teach receiving a user input moving the 
second lens to cover a second portion of the second area; and responsive to 
the user input, automatically changing a position of the sub-lens to cover a 
portion of the first area of the GUI corresponding to the second portion of the 
second area (figure 2, column 6, 42-51 , the sub-lens is used to select any 
other keyframe in the first display 172). 

As to claim 86, Jain et al. teach displaying a sub-lens covering a portion of 
the first area of the GUI corresponding to the first portion of the second area 
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of the GUI covered by the second lens (figure 2, column 6, 42-51 , the sub- 
lens is used to select any other keyframe in the first display 172). 
As to claim 87, Jain et al. also teach receiving a user input moving the 
second lens to cover a second portion of the second area; and responsive to 
the user input, automatically changing a position of the sub-lens to cover a 
portion of the first area of the GUI corresponding to the second portion of the 
second area (figure 2, column 6, 42-51 , the sub-lens is used to select any 
other keyframe in the first display 172). 

Allowable Subject Matter 

Claims 8, 16, 47, 55, 83 and 91 are objected to as being dependent upon 
a rejected base claim, but would be allowable if rewritten in independent form 
including all of the limitations of the base claim and any intervening claims. 

Response to Arguments 

Applicant's arguments with respect to the feature of "displaying a first 
lens moveable in response to user input over representations of multimedia 
information displayed in the GUI." have been considered but are moot in view of 
the new ground(s) of rejection. 
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Applicant has argued that Jain fails to teach "displaying, in a second area 
of the GUI, a representation of multimedia information displayed in the first 
portion of the first area, the representation of multimedia information displayed 
in the second area comprising a portion of the representation of information of 
the first type covered by the first lens and a portion of the representation of 
information of the second type covered by the first lens." 

However, the examiner respectfully disagrees with the argument. Jain 
discloses a panel 172 displaying the live video being digitized, with play, 
stop. ..The frame portions are displayed in section 184 such as frames 186, 188, 
1 90, 1 92, 1 94 that are video clips of the live video. A user captures a desired 
portion (frame portion 172) of the live video including the frame portions of 
video clips. So, the live video includes video clips such as portions 172, 186, 
188, 190, 192, 194. The user puts a first lens covering the desired frame 172 by 
capturing and selecting the desired video clip frame 172. If the user wants to 
capture a next video clip (like frame 186), the user puts a second lens on the 
frame 186 by selecting and then displaying this frame in the panel 172. 
Therefore, Jain discloses displaying the first lens covering a portion of the first 
area. Jain discloses multiple frame portions in the second area 176. Each 
portion includes two types of information: image and text. Text is displayed at 
the bottom of each image in the second area 176. Applicant's attention is 
directed to image 202. The text information is displayed at the bottom of the 
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image 202. It is clearly that Jain discloses a second area that presenting 
portions of the first and second types of information covered by the first lens. 

Applicant also argued Jain fails to provide any active correlation between 
the act of displaying frames in the preview window and the act of displaying 
information in the other portions of the GUI. 

However, the Applicant has argued the limitations (any active correlation 
between the act of displaying frames in the preview window and the act of 
displaying information in the other portions of the GUI) which are not recited in 
the claim. During patent examination, the pending claims must be "given >their< 
broadest reasonable interpretation consistent with the specification." > In re 
Hyatt, 211 F.3d 1367, 1372, 54 USPQ2d 1664, 1667 (Fed. Cir. 2000). Although 
the claims are interpreted in light of the specification, limitations from the 
specification are not read into the claims. See In re Van Geuns, 988 F.2d 1 181 , 
26 USPQ2d 1057 (Fed. Cir. 1993). 

Applicant always has the opportunity to amend the claims during prosecution, 
and broad interpretation by the examiner reduces the possibility that the claim, 
once issued, will be interpreted more broadly than is justified. In re Prater, 415 
F.2d 1393, 1404-05, 162 USPQ 541, 550-51 (CCPA 1969). 
Also, the Applicant does not specify the invention in the claimed language. The 
claimed language itself "displaying, in a second area of the GUI, a 
representation of multimedia information displayed in the first portion of the first 
area, the representation of multimedia information displayed in the second area 
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comprising a portion of the representation of information of the first type 
covered by the first lens and a portion of the representation of information of the 
second type covered by the first lens" is still broad. It does not specify the 
present specification. 

Applicant has also argued that Jain fails to teach displaying a first thumbnail 
image in the first area of the GUI and displaying a second thumbnail image in 
the first area of the GUI. 

However, the examiner respectfully disagrees with the above argument. The 
thumbnail (the current frame 172) in the first area (area 172) comprises a first 
thumbnail image (which is the video picture) and a second thumbnail image 
(which is the text information underneath the video picture. The text information 
is similar to a text in figure 3, 314, 322 of the present invention). So, the first 
and second thumbnail images are displayed in the first area of the GUI (figure 
2, area 172). 

Conclusion 

Any inquiry concerning this communication or earlier communications 
from the examiner should be directed to Mylinh Tran. The examiner can 
normally be reached on Mon - Thu from 7:00AM to 3:00PM at 571-272-4141 . 

If attempts to reach the examiner by telephone are unsuccessful, the 
examiner's supervisor, Weilun Lo, can be reached at 571-272-4847. 

The fax phone numbers for the organization where this application or 
proceeding is assigned are as follows: 

571-273-8300 
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Information regarding the status of an application may be obtained from 
the Patent Application Information Retrieval (PAIR) system. Status information 
for published applications may be obtained from either Private PAIR or Public 
PAIR. Status information for unpublished applications is available through 
Private PAIR only. For more information about the PAIR system, see 
http://pair-direct.uspto.gov. Should you have questions on access to the Private 
PAIR system, contact the Electronic Business Center (EBC) at 866-217-9197 
(toll-free). 

Mylinh Tran 
Art Unit 2179 



/Weilun Lo/ 

Supervisory Patent Examiner, Art Unit 2179 



